MAXIMAL THEOREMS FOR THE DIRECTIONAL HILBERT 
TRANSFORM ON THE PLANE 
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Abstract. For a Schwartz function / on the plane and a non-zero u S define 
the Hilbert transform of / in the direction v to be 

Hv f{x) = p.v. [ f{x - vy) — 
Jr V 

Let ^ be a Schwartz function with frequency support in the annulus 1 < |C| < 2, 
and Cf = C*f- We prove that the maximal operator sup|^,|^]^|H,j Cf\ maps 
into weak L^, and into for p > 2. The estimate is sharp. The method 
of proof is based upon techniques related to the pointwise convergence of Fourier 
series. Indeed, our main theorem implies this result on Fourier series. 



1. Introduction, Principal Theorem 



Our interest is in the directional Hilbert transform applied in a choice of directions 
of the plane. Thus, for v eM? — {0}, set 

fix) = p.v. [ f{x-vy) — 

Jr y 

This definition is independent of the length of v, and below we shall only concern 
ourselves with |f | = 1. Let C be a Schwartz function with frequency support in the 
annulus 1 < |^| < 2, and C/ = C * /• Our Theorem is 

1.1. Theorem. The maximal operator HI f := sup|^|^;^|H^ C/l "fnaps into weak 
L?' , and into U for p > 2. 
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Figure 1: The function / is depicted by the circle centered at the origin with a planar 
wave in the vertical direction indicated by the two horizontal lines. In the shaded 



This theorem is a complement to a corresponding result, due to J. Bourgain [3], 
for the directional maximal function, namely 



See Section 3 for a discussion of norm bounds for this operator. For both this operator, 
and H*, the estimate of weak square integrability is sharp, as was pointed out to us 
by M. Christ [5]. This argument may summarized as follows. Begin with a Schwartz 
function (p > with frequency support in a small ball about the origin in the plane. 
Then consider f{xi,X2) := e^'^'^ ip{xi, X2). For any point in the plane x = {xi,X2) 
with |xi| > 2 and 1 < j^la^il, consider the line that passes through x and the 
origin. The real part of / will not change sign on this line, so we compute the Hilbert 
transform with no cancellation. That is H*/(x) ~ M*/(x) ~ And is 

just in weak L^. See Figure 1. 

Whereas, Bourgain's argument for the maximal estimate is not difficult, the The- 
orem above is of necessity somewhat harder, as it implies the pointwise convergence 
of Fourier series in one dimension. This is an observation in the style of De Leeuw. 
One considers the trace of the operator in frequency variables along any line in the 
annulus 1 < |^| < 2. This is Carleson's theorem, [1], but also see [(>]. As such, we use 
a method which is adopted from the proof of Carleson's Theorem given by M. Lacey 
and C. Thiele [12]. 

Perhaps the principal novelty of this paper is the suitable adaptation of the time 
frequency analysis of Lacey and Thiele to the current setting of the plane and mea- 
surable choice of directions. In comparison to the proof of Lacey and Thiele, we find 
a rather precise analogy between the proofs on the real line, and the proofs on the 
plane. Some differences arise from the notion of a tile, which requires some care to 



region, if^/ ~ 



(1.2) 
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define, and the proof of tlie "size Lemma," an ortliogonality statement, requires a 
small amount of innovation. 

Finally, there is an outstanding question, attributable to E. M. Stein [1 1], concern- 
ing the boundedness of the Hilbert transform on families of lines that are determined 
by say a Lipschitz map. Thus, for a map v : ^ {|a;| = 1}, one wishes to know if 

f f{x-yv{x)) — 

J-i y 

is a bounded operator on say L^. Positive results for analytic and real analytic vector 
fields are due to respectively Nagel, Stein and Wainger [13], and to Bourgain [i]. In 
a subsequent paper, the authors [i i] will prove that the operator above is bounded 
on if t> G C^+*^, for any positive e. The results of this paper are a crucial aspect of 
the proof of this result. 

The presentation of this paper has been substantially improved by skilled and 
generous referees. 



2. Definitions and Principle Lemma 



We begin with some conventions. We do not keep track of the value of generic 
absolute constants, instead using the notation A < B iS A < KB for some constant 
K. And A ~ S iff A < S and B < A. We use the notation Ia to denote the 
indicator function of the set A. And the Fourier transform on is denoted by 
/(^) = /ig2 e~^™''^ /(x) dx, with a similar definition on the real line. We use the 
notation 

-f f dx:= \A\-'^ I f dx. 

J A J A 

2.1. Definition. A grid is a collection of intervals Q so that for all I,J&Q, we 
have I n J E {0, /, J}. The dyadic intervals are a grid. A grid Q is central iff for all 
I, J E Q, with I J we have 100/ C J. 

The reader can find the details on how to construct such a central grid structure 
in [7]. 

Let p be rotation on T by an angle of — 7r/2. Coordinate axes for are a pair of 
unit orthogonal vectors {e,e±) with e = pe±. 

2.2. Definition. We say that uj cM.'^ is a rectangle if it is a product of intervals with 
respect to a choice of axes {e,ej_) o/M^. We will say that uj is an annular rectangle 
if oj = (— 2^""^, 2'~^) X (a, 2a) for an integer I with 2' < a/8, with respect to the axes 
(e, e±). The dimensions ofuo are said to he 2' xa. Notice that the face {—2^~^, 2^~^) xa 
is tangent to the circle \^\ = a at the midpoint to the face, (0,a). We say that the 
scale of u is scl(c<j) := 2' and that the annular parameter of u is ann(u;) := a. In 
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referring to the coordinate axes of an annular rectangle, we shall always mean (e, ej,) 
as above. 



It is imperative to keep in mind that the choice of basis in the definition of a tile 
depends upon the tile in question. These definitions are illustrated in Figure 2. Also 
see Figure 6. 

Annular rectangles will decompose our functions in the frequency variables. But 
our methods must be sensitive to spatial considerations; it is this and the uncertainty 
principle that motivate the next definition. 

2.3. Definition. Two rectangles R and R are said to be dual if they are rectangles 
with respect to the same basis (e, e±), thus R = ri x r2 and R = ri x r2 for intervals 
ri, ri, i = 1,2. Moreover, 1 < |rj| ■ |rj| < 2 for i = 1,2. The product of two dual 
rectangles we shall refer to as a phase rectangle. The first coordinate of a phase 
rectangle we think of as a frequency component and the second as a spatial component. 

We consider collections of phase rectangles AT which satisfy these conditions. For 
s,s' E AT we write s = Rg x Ug, and require that 



(2.4) Us is an annular rectangle 

(2.5) Rs and are dual rectangles, 

(2.6) The rectangles Rg are the product of intervals from a central grid. 

(2.7) {R : RxLUg e AT} partitions for all Ug. 

(2.8) ann(ti;s) = ann for some fixed ann, 

(2.9) i{ujg : scl(s) = 2\ ann(s) = ann} > cann2"', / G Z. 




(2.13) \uJs\<K- 



We assume that there are auxihary sets u>s,iJL>si,'^s2 C T associated to s — or more 
specifically Us — which satisfy these properties. 

(2.10) fi := {ujs,u}si,ij^s2 '■ s G AT} is a grid in T, 

(2.11) a;sina;s2 = 0, cj^ := hull(cj^i, a;s2), 

(2.12) u^si lies clockwise from lJs2 on T, 

scl(u;^) 
ann(ci;s) ' 

(2.14) {X : ^ G cuj C puJsi, 

(2.15) to each there is a cJ^ G ^2 such that 2iJs C uJ^ C 4:UJs- 

Recall that p is the rotation that takes e into e±. Thus, e^j^ G oj^i. See Figure 2. 

Note that \ujs\ > \uJsi\ > scl(ct;s)/ann(cc;s). Thus, e^j^ is in u^si, and u)s serves as "the 
angle of uncertainty associated to Rg." Let us be more precise about the geometric 
information encoded into the angle of uncertainty. Let Rg = Vg x r^^ be as above. 
Choose another set of coordinate axes (e', e'^) with e' G ujg and let R' be the product 
of the intervals and rs± in the new coordinate axes. Then K^^R' C Rg C KqR' 
for an absolute constant Kq > 1. 

We say that annular tiles are collections ^T(ann) satisfying the conditions (2.4) — 
(2.14) above. The constant ann > appears in (2.8) and (2.9). We extend the 
definition of e^^, e,j_L, ann(ci;) and scl(u;) to annular tiles in the obvious way, using the 
notation e^, eg±, ann(s) and scl(s). 

A phase rectangle will have two distinct functions associated to it. In order to 
define these functions, set 

TraUy f (x) := f (x — y) , ?/ G (Translation) 

Mod^ f{x) := e^«-^ /(x), ^ G (Modulation) 



6 



MICHAEL T. LACEY AND XIAOCHUN LI 



DiF^,xH./(-i,-2) := (Dilation). 

In the last display, < p < oo, Ri x R2 is a. rectangle, and the coordinates (xi,X2) 
are those of the rectangle. Note that the definition depends only on the side lengths 
of the rectangle, and not the location. And that it preserves norm. 

For a function ip and tile s G AT set 

ifs ■■= Modc(^,) Tc(R,) Dil^^ !f, 

where c(J) is the center of J. Below, we shall consider to be a Schwartz function 
for which 1^ > is supported in a small ball about the origin in M^, and is identically 
1 on another smaller ball around the origin. 

We introduce the tool to decompose the singular integral kernels, and a measurable 
function v : — {|x| = 1} that achieves the maximum in our operator, up to a 
factor of say ^. We consider a class of functions ipt, t > 0, so that 

(2.16) Each ipt is supported in frequency in [—6 — n, —9 + k]. 

(2.17) l^t(x)| <(l + |x|)~V'^ 

In these conditions, < k < 1 is a small fixed constant that need not concern us. In 
the top line, ^ is a fixed positive constant, that is only needed to ensure that (2.18) 
below is true. For the current paper, we will need only single choice of function ^, but 
for the authors' subsequent paper [11], countably many choices of ip will be needed. 

Define 

<Ps{x) := / ips{x-yv{x))i)s{y) dy 

(2.18) ''^ 

= lu,Av{x)) i (fisix - yv{x))ipsisy) dy. 
Jr 

(2.19) ips{y) := scl(s)^sci(s)(scl(s)?/). 

An essential feature of this definition is that the support of the integral is contained 
in the set {v{x) e uJs2}, a fact which can be routinely verified, for an appropriate 
choice of 6 in (2.16). See Figure 2. That is, we can insert the indicator l^^^^{v{x)) 
without loss of generality. The set (jJs2 serves to localize the vector field, while ujgi 
serves to identify the location of cps in the frequency coordinate. 

Remark: It seems likely that we could prove our theorem using the simpler defi- 
nition 0s = (1^,52 °^)v^s- This would be the natural analog of the decomposition used 
by Lacey and Thiele [12]. But, in our subsequent paper on this subject [11], we will 
need to consider a truncation of the Hilbert transform, which suggests the definition 
of (ps we have adopted above. We will also want to rely upon facts proved in this 
paper, to deduce our theorems concerning smooth vector fields. 




The model operators we consider are defined by 

seAT{ann) 

In this display, AT{anr\) := {s e AT : ann(s) = ann}. 

2.20. Lemma. Assume that the vector field is only measurable. The operator Ca„„^y 
extends to a bounded map from into weak L"^ , and into itself for 2 < p < oo. 
The norm of the operator is independent o/ann. 

We shall prove that Cann.i; maps into weak L^, with constant independent of ann 
and V. By duality, it suffices to show that for all / G fl L°° of norm one, and 
sets F C of finite measure 

(2.21) |(C3nn,./,M|< \{f,^s){<Ps,lF)\<\F\'/'. 

sG»4T(ann) 

By dilation invariance of the Ca„„^v with respect to powers of 2, we can further take 

r<iF|<2. 

For the case of i/, 2 < p < oo, we shall demonstrate the restricted type estimate 

|{Cann,.l£; > A}| < X-P\E\, 2 < p < OO. 

We need only consider this for A > 1, by the weak bound. Moreover, this inequahty 
follows from 

|(C3nn,.lE,lF)| < Yl \{^s:1e){(I>s:1f)\ 
seAT{ann) 

(2.22) < |F|{log(|F|/|F|)}, 9|F| < \E\. 

Observe that by dilation invariance, we can assume that l^l ~ 1. We also assume 
that the vector field v is defined only on F. 
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The proofs of (2.21) and (2.22) are given in Section 5, and follow lines of argument 
that are similar to the one dimensional case, as given by Lacey and Thiele [12] for 
the case p = 2, and as in the work of Grafakos, Tao, and Terwilleger [8] for the case 
of p > 2. Also see the survey by Lacey [10]. 

Remark. The results for p > 2 are available essentially because functions that are 
in are also locally square integrable. In one dimension, the proof of convergence of 
Fourier series for 1 < p < 2 is closely linked to the boundedness of the Hardy Little- 
wood maximal function. In the current setting, the corresponding maximal function 
would be the unbounded Kakeya maximal function. The relationship between maxi- 
mal functions and bounds for 1 < p < 2 has a subtle role to play in our companion 
paper [11]. 

3. A Maximal Function Estimate 

We outline a proof of the weak type estimate on for the directional maximal 
function M* given in (1.2). 

3.1. Proposition. For functions f G , and X> 0, we have the estimate 

m*f>x}\<x-^f\\i 

It suffices to give the same bound for the model operator 

-Mann,!,/ := SUp \ {f , (fs) (ps\- 
seAT{ann) 

It is a remark that this inequality is an immediate consequence of Lemma 2.20. The 
sum defining Cann,i> is in fact unconditional convergent as a sum over tiles. As a 
consequence the square function below, which trivially dominates A4ann,v, satisfies 
the weak type inequality. 

seAT{ann) 

There is however a more direct proof. The fundamental fact, a simplier instance 
of the Size Lemma, Lemma 5.5 above, is this. 

3.2. Lemma. Fix f E L"^ and A > 0. Let S C ^T(ann) be a collection of tiles which 
are pairwise incomparable with respect to the partial order ' and moreover for each 
s E S, \{f,(ps)\ > A a/ I -Rs I ■ Then it is the case that 

J2\^s\<X-'\\f\\l 

This is proved in e.g. Barrionuevo and Lacey [ ], or one can inspect the proof of 
Lemma 5.5 for a number of simplifications which apply in this case. 



MAXIMAL DIRECTIONAL HILBERT TRANSFORM 

It is then clear that the maximal function below maps into weak LP'. 

\{f.^s)\ , 

1; 



For t > 2, set 



Ai/:= sup ±R^ 

^T(ann) Vl-^*l 

Atf:= sup ^— ItRs 

^r(ann) ^/\Rs\ 

This also maps into weak with norm at most < t^. Indeed, given A > 0, let S 
denote the maximal tiles with |(/, (ps) \ > ^^/\Rs\■ Then, it is the case that 

{Ai/>A}C \jKtRs 

ses 

for an absolute constant K. Hence, the claimed bound follows from Lemma 3.2. 
Then observe that 

oo 
k=0 



4. Proof of Theorem 1.1 



We show that Lemma 2.20 implies our main result. Theorem 1.1. Indeed, the 
Hilbert transform in the direction of v is, in the limit, an appropriate average of 
the discrete sums formed in the previous section. Let A be a smooth radial function 
satisfying 

(4.1) l[|,|](|e|)<A<l[i,2](|ei) 

Let Xt{y) = t^\{ty), where we choose the dilation so that t is the Fourier parameter, 
as opposed to the more common definition t~'^X{y/t) where t is the spatial parameter. 

Let K be the distribution on M 

oo 
j=-co 

where ip is a Schwartz function with ip > supported in a small neighborhood of —1. 
In particular, the distribution 

[ 2'K{2'y)ds 
Jo 

is a non-zero multiple of the distribution associated with projection onto the negative 
frequencies on M, which is itself a linear combination of the identity operator and the 
Hilbert transform. 
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To prove Theorem 1.1 it suffices to demonstrate the same norm estimates for the 
hnear operators in which the distribution l/y is replaced by K{y), namely for 



(4.2) T, fix) = J Aann * f (x - yvix))K{y) dy 

in which ann > 0, and the measurable v{x) are arbitrary. 



For values of 2-' < ann, let 

Sj.ann/ = ^ {f,fs)fs 

sG.4T(ann) 
scl(s)=2J" 

T„./(x) = j fix- yv{x))2^^{2^y) dy 

Also, let Rot^ be the operation of rotation by angle r. The main point concerns the 
operator 

Sj-ann := limf Dil?o2-=)x[0 2-'') R-ot-r Tran_y 
(4.3) ^--iBox(y) ^ ' ^ ' • ^ 

Sj,ann TraUy Kotr DII^q x [0,2=) dy dr ds 

where Box(y) = [0,y]^ x [0, 27r] x [—1, 1]. Observe these points. (1) The integral is 
an average over dilations, rotations, and translations. Note that the averaging over 
dilations is done uniformly in all directions, and that we are implicitly averaging with 
respect to the Haar measure on M+, with the rationale being that the dilations we are 
using are a representation of that group. (2) Due to the assumption (2.7), the limit 
in Y will exist. If ujs were fixed we would only need to average over all translations 
indexed by a fixed rectangle. (3) The rectangles Ug fill a fixed percentage of the 
annulus, due the assumption (2.9). (4) The limit, applied to a Schwartz function / 
is seen to exist. 

4.4. Lemma. For each 2^ < ann, we have the identity 

Sj,ann Aann * / = c(j, ann)Aann * / 

where the constant c{j, am) satisfies c^^ < |c(j, ann)| < c, for some absolute constant 
c. 



To deduce bounds for (4.2), observe that Lemma 2.20 concerns norm bounds for 
the sums 

^c(j, ann)~^T 

j 

Of course the coefficients c(j, ann)~^ do not appear in Lemma 2.20. Yet the placement 
of the absolute values in (2.21) and (2.22) demonstrates that the sum is unconditional 
over tiles, so that we can impose an arbitrary bounded sequence of coefficients, as we 
have done here. 
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The same norm bounds hold for averages of these sums, such as the averages that 
occur in (4.3). Hence, by the Lemma just stated, we deduce the norm bounds for the 
operators in (4.2). It remains to prove our Lemma. 

Before proving Lemma 4.4, we record a simple lemma on convolutions. 
4.5. Lemma. Let (f and be real valued Schwartz functions on M^. Then, 

/ (/, Trauj^+m if) Trauy+m (j) dy = f * ^ 

Where ^x) = JJ + u) du. 

In particular, $ = (pcj). 

The proof is immediate. The sum and integral in question is in fact the integral 

f{z)(p{z - y)(j){x - y) dydz 
and one changes variables, u = z — y. 

Proof of Lemma 4-4- Fix uj E Q, and let 

seAT{ann) 

UJs=UJ 

Recall (2.7). Consider the limit 

:= lim Tran_y Tran^ dy 

This limit can be explicitly written as an integral which is as in Lemma 4.5 with 
an additional dilation. Thus, this limit is convolution with respect to where 

By choice of cp, it has non-zero Fourier transform, and is identically one on a small 
ball around the origin. Hence has Fourier transform with the same properties. 
The definition of Sj ann also incorporates an average over dilations and rotations. 
Concerning the average over rotations, recall (2.9), which states that we have an 
essentially maximal number of Ug of a given scale. Concerning the dilations, observe 
that 

Dilfo,). [F * [Dilfo,i/t)2 G] = [Dilfo,). F] * G. 
These observations lead to the conclusion that 

Sj.ann/ = c(j, 3 nn)<l> * / 



$ = /" f Dil[0 2t)2Rot^<l> rfrrft 

J-l J[0,2it] 
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Figure 5: On the left, the interval uJs and the axes (e^, e^j.). On the right a rectangle 
Rg. In gray, the axes and rectangle Rg are rotated by an amount less than the length 

of LOs. 

The leading constant in the top line satisfy the claimed bound c < |c(j, ann)| < c^^ 
for a positive constant c. The main point here is assumption (2.9). This function $ 
has Fourier transform that is identically constant for all ann < |,^| < 2a nn. The value 
of this constant depends only on the choice of (p. □ 

5. The Main Lemmas 

We need these notions associated to tiles and sets of tiles, all directly inspired 
by the proof of Carleson's theorem as given by Lacey and Thiele [12]. There are 
however some changes in the current context, due to the setting of the plane, and 
our frequency analysis in an annulus. 

An important heuristic is that given a tile s, the interval uJa serves as an "angle of 
uncertainty" for the rectangle Rg. The geometric consequence of this can be described 
as follows. Write Rs = Rsi x Rs2 in the coordinate axes (e^, e^j,). There is an absolute 
constant K so that the following holds. Suppose that (e, e±) is some other choice of 
coordinate axes with 

\es - e|, \es± - e±\ < \ujs\. 
Write Rs = Rsi x Rg2 but the choice of coordinate axes is {e,e±). Then, we have 
Rs C KRs and Rg C KRs. The implication is that Rg and Rs are essentially the 
same rectangles. See Figure 5. 

There is a natural partial order on tiles, but it does take some care to define. For a 
fixed tile s, let (e^, es±) be the coordinate system associated with Rs. While we have 
not done so to date, at this point we insist that the origin in each such coordinate 
system specify the same point in the plane. Let V be the set of the projections of 
all i?s's onto Cs and V± be the set of the projections of all i?s's onto es±- We have 
assumed that P is a central grid in (2.6). Namely, recalling Definition 2.1, any two 
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7c{Rs',es'±) I 



Tr{Rs,es') 



Figure 6: For tiles s < s', only showing possible relative positions of Rg and R'^, as 
well as the projections that define the partial order. 



intervals I ^ J m V which intersect, satisfy either 100/ C J, or lOOJ C /. The 
reader can find the details on how to construct such a central grid structure in [7]. 
As all tiles lie in a single annulus, the collection of intervals V_[_ consist of disjoint 
intervals of equal length that partition R. 

Let 7T{R,es) be the minimal interval in V containing the projection of R onto e^, 
and let 7r(i?, es±) be the minimal interval in V± containing the projection of R onto 
es±- 

For two tiles s, s', write s < s' if and only if ujg D LOgi, 7r{Rs, e^/) C n^Rs', e^/), and 
7i{Rs', es'±) = n{Rs', es'±). This is illustrated in Figure 5. 

We have been careful to define this partial order in such a way that it is transitive 
and that we have the conclusion 

(5.1) If X n oj^r X Rg/ ^ 0, then s and s' are comparable under '<'. 

Proof. We can assume that oJs D ^*^s', as these intervals of the circle come from a grid. 
And so it suffices to check that 7r(_Rs,es/) C 7i{Rs',es')- By the assumption (2.6), in 
which K is a small positive constant, we would have TT{Rs,es') C 1007r(i?s, e^j), and 
the last set is contained in 7i{Rs', e^/), by the central grid property. □ 

A tree is a collection of tiles T C ^T(ann), for which there is a (non-unique) tile 
ujt X i?T £ ^T(ann) with s < ujt y< Rt for all s G T. For j = 1, 2, call T a j-tree if 
the tiles {ujgi x Rs '■ s & T} are pairwise disjoint. 1-trees are especially important 
for us; Figure 6 presents an useful illustration of such a tree. 

Observe that for any tree T we have, due to our comments about intervals of 
uncertainty. 



Rs C KR' 



T, 



s e T. 
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Thus, the spatial locations of the tiles are localized. In addition, as C cjt, we 
can regard all rectangles Rg as having a fixed set of coordinate axes, namely those 
of the top. This is a key point of the "Tree Lemma" below. 

Fix a positive rapidly decreasing function x, and for rectangle R set 
(5.2) xr = Tranc(/j) Dil^ x 

dense (s) := sup / Xr^i dx, 

dense(S) := sup dense (s), 

sGS 

sh(S) := \^ Rs (the shadow of S ) 
ses 

A(T)^:= 5:^4^1^- Tisal-tree, 



21 1/2 



size(S):= sup Y.\{f , ^ 

TCS 

T is a 1-tree ^ 

Recall that we are to prove (2.21) and (2.22). In (2.21), ||/||2 ~ 1, |F| < 2, and v is 
defined only on F. 

The first two Lemmas concern density and size, and are the same form. The third 
Lemma, the 'Tree Lemma' relates size, density and trees. 

5.3. Lemma. Any S C ^T(ann) is the union of Sheavy CL'nd Sught satisfying these 
conditions. 

dense (Slight) < | dense (S). 
The collection Sheavy ^-5 a union of trees T G ^hoavy; with 

(5.4) |sh(T)| < dense(S)'V|. 



heavy 



5.5. Lemma. Any S C ^T(ann) is the union of Sbig and Sgmaii satisfying these 
conditions. 

size(Ssmaii) < I size (S). 
The collection Sbig is a union of trees T G Tbig, with 

(5.6) |sh(T)| <size(S)-l 

5.7. Lemma. For any tree T we have the estimate 

Y){f.^s){4>sAE)\ < dense(T)size(T)|i?T|. 



The first lemma has a proof which is essentially identical to the proof of the "mass 
lemma" in M. Lacey and C. Thiele [12]. We do not give the proof. The second lemma 
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follows the well established lines of proof, yet must introduce a new element or two 
to address the two dimensional setting. The complete proof is given. The proof of 
the last lemma is quite close to that of M. Lacey and C. Thiele [12]. We shall give a 
proof. 

The lemmas are combined in this way to prove (2.21), and hence Lemma 2.20 in 
the case of the weak estimate. Lemma 5.3 and Lemma 5.5 should be applied so 
that their principal estimates (5.4) and (5.6) are approximately equal. The density 
of ^T(ann) is at most a constant. The size of ^T(ann) is at most a constant times 
the norm of /. Thus, we can take the set of all tiles ^T(ann) and decompose it 
into sub collections Scr, a G {2" : n G Z}, so that is the union of trees T G 
such that 

Ter^ 

dense(S^) < min(l, cr""^), 
size(S<,) < a-^l\ 

Hence, it follows that 

5^|(/,VP.)(</>.,li.)| < 5^min(l,a-V-V2|i?^| 

s(i'&„ TeT 

< min(al/^a-l/2). 

This estimate is summable over a G {2" : n G Z} to an absolute constant. This 
completes the proof of (2.21). 

A small variation on this argument proves (2.22). In this instance, the function 
/ = 1e, with \E\ ~ 1, so that the size of ^T(ann) is < 1. And ^T(ann) is a union 
of sub collections So-, cr G {2" : n G N}, so that So- is the union of trees T E % 
satisfying 

J2\Rt\<cx\FI 

dense(So) < o-"\ 
size(S.) < min(l,(a|F|)-i/2). 

Hence, it follows that 

^|(le,VP.)(0.,M|<|F|i/2min(|F|V2,a-i/2). 

Recall that in this instance, \F\ < |. This estimate is summable over a G {2" : n G 
N} to |F||log|F||. This completes the proof of (2.22). Our proof of Lemma 2.20 is 
complete, aside from the proofs of the three Lemmas of this section. 
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6. The Size Lemma: Orthogonality 

We give the proof of Lemma 5.5. We find that the proof involves for the most 
part a standard argument in the hterature. See for instance the proof of the energy 
lemma in M. Lacey and C. Thiele [12]. Yet there is a point at which we will rely 
upon the strong maximal function, with respect to a choice of axes that is specified 
in a particular way by the set of tiles. 

We can assume that all u)a lie in a fixed half circle. Note that in 1-trees, that 
as the scale of tiles increase, the intervals uigi move off in a clockwise direction, as 
depicted in Figure 6. We take advantage of this in a specific way below. 

The initial step of the proof is to construct a collection of 1-trees T G 7^ and 
use them to construct the collection of trees Tbig. The process is inductive. Let 
size(S) = a. Initialize 

T+ := 0, Tbig = 0, 8^*°^^^ = S. 
While size(S''*°"'^) >a/2, select a 1-tree T G S'^*"^'^ such that 

(6.1) 5^|(/,^.)r>^|i?T|, 

sex 

|i?T| is maximal, and ojt is most clockwise. Then set t(T) C 8'^*°'^'^ to be the maximal 
(with respect to inclusion) tree in 8'^*°'^'^ with top o^t x Rt- Update 

r+ = r+U{T}, Tbig = Tbig U{r(T)}, gstock _ gstock _ 
When size (8'^'^°'^'^) < (t/2, set 8^*°'^'^ = T^maii, and the process stops. 
To conclude the Lemma, we need to show that 

(6.2) El^T|<a-^ 

TeT+ 

We have constructed these trees so that they satisfy a property very useful to the 
verification of this inequality. Suppose T 7^ T' G 7^ and s G T and s' G T' with 
Us C (jJs'i- Then R^^Rs' = 0. We refer to this property as "strong disjointness." To 
see that it holds, we have cjt C Us'i-, so that cjt lies in the clockwise direction from 
Wt'. Hence T was constructed before T'. Thus, if i?T H Rs' 7^ 0, we would conclude 
from (5.1) that s' < ljt x -Rt. Hence, s' G t(T), and so s' would have been removed 
from 8'^*°'^'^ and so could not be in T'. 

Adopt the notation -F(8') = J^ses'if^ fs)fs- Now observe that for 8+ = IJTeT+ 
we have 

= (/,^(S+)) 
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< ll/l|2||i^(S+)|h 

This follows by Caucliy-Schwarz. Recall that ||/||2 = 1. To conclude (6.2) we need 
only show that 

(6.3) \\F{S^)h<a[j2\RT\''^' 

Ter+ 



For s G T, set 

B=(s) = {/G S+-T : a;, = a;,.}, 
B(s) = {s' G S+ - T : a;, a;,/}. 

Note that if s' G S+ - T is such that (v?s, v^s') 7^ 0, then s' G B=(s) U B(s). And if 
s' ^ B=(s) U B(s), then {ips,ips') = 0. We estimate 



(6.4) ||i^(S+)||^< $^||F(T)||^ 

TGr+ 

(6.5) + 5^(/,¥..)(vp.,F(B=(s))) 

(6.6) + ^(/,^.)(^.,F(B(s))) 

SGS+ 

It is a routine matter to verify that for each 1-tree T G 7^, 

iii^(T)ii2<^ii?Tr/^ 

Therefore, the right hand side of (6.4) is no more than 

T&T+ Ter+ 

For the term (6.5), we use this estimate for tiles s, s' with ujs = <^s 

\{Vs,Vs')\ < Trane(fl,)Dil°j^x(c(/?s'))- 
This with Cauchy-Schwarz gives 

2i 1/2 



(6.5)< [5^|(/,^.)r5^|(^.,F(B4s))) 



ses+ SGS4 



Ter+ 

as required by (6.3). 
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As for(6.6), use Cauchy-Schwarz in s to bound it by 



5^ (/, (y.., F(B(.))) < [ 5^ I (/, P 5^ I F(B(.))) I 




-1 1/2 
2 ' 



TeT+ S6S+ 



We show that for each T G Ti 



(6.7) El(^-^(B(s)))r<a^|i?T 



This will complete the proof of (6.3). 




This is a consequence of the grid structure on the intervals ujg- Moreover, all tiles 
s G T U B have u^s ^ ^t- This has two implications. The first is that all rectangles 
Rs can be regarded as rectangles with respect to a fixed set of coordinate axes, those 
for the top of the tree. Let M denote the strong maximal function computed in these 
coordinate directions. 

The second, is that the strong disjointness condition applies to each pair of tiles in 
the collection B. This yields the essential observation that the rectangles {Rg : s G 
B} are pairwise disjoint, and do not intersect -Rt- 

Make a further diagonalization of the set B. Set Bi = {sGB : Rs C 4:Rt-}, and 



for /c > 1, set Bfc = {s G B : R^ C A'^R^, R^ <f. ^^'^^R^}. Let us point out that 




F(B,)||2<4V|i?Tr/'. 



^(Bfc)|<a5^X«.*li?,. 



Therefore, for any function g 




< a 




<a4'=|/2Tr/' 



9\\2 



Clearly, this implies (6.7) for Bi. 



For A; > 1, we can strengthen our inequality to the following. 



F(B,)|U.(4.-iK,)<4-i°'=|i?Tr/'. 
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Figure 7: A few possible tiles in a 1-tree. Rectangles ujg are on the left in different 
shades of gray. Possible locations of Rg are in the same shade of gray. 



We use this, together with the fact that the functions {(^g '■ s G T} are equivalent 
to an orthonormal basic sequence. Hence, 

But the functions {(fg '■ s G T} are highly concentrated in a neighborhood of Rt- In 
particular, for any function g, 

Clearly, this completes the proof of (6.7). 

Proof of Lemma 5.7. We may fix the vector field, and assume that the standard 
basis (e, e±) are the basis for the rectangle Rt- As a consequence, we can without 
loss of generality assume that this is the basis for all the tiles s G T, and we write 
Rs = Rs,e X Rs,ej^- Set 6 = dense (T) and a = size(T). For this proof, we set 

x(^) = Tran,(fi,)Dil^^, < p < oo. 

Let J7e be those maximal dyadic intervals J in M for which for 3 J does not contain 
an interval Rs^e for some s G T. This collection partitions M. Let J'e^^ be that partition 
of M into maximal dyadic intervals J such that -Rx.ex ^ 3 J. Let K, = Je^- For 
each rectangle K & set T(_ft", ±) to be those s G T for which ± log|i?s,e|/|-^e| > 0. 

Choose signs Eg G {±} such that 



seT sST 
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Let 



Estimate, without appeal to cancellation 

[ dx<a [ dx 

The rectangles Rg are smaller than those of K, and do not intersect it. For each 
X & K, the numbers X^r^\x) ^ 1; they decrease as dist(-Rs, i^') ■ scl(s) increases. 
The integral above is at most 

/ 1^4'^'''^'"'' I dx < 5(7 mindful, |i?T| sup sup X^r^\^))- 
This is summable over G /C to at most < (5cr|i?T|- 

If T{K, +) is non-empty, then K C ARt- Set 

GK:=Kn \J 

seTiK,+) 



which contains the support of 1kA'^^^^'~^\ Our assertion is that \Gk\ ^ ^I-^I- To 
see this, let K = x Ke±, and K'^ be the dyadic interval that contains and 
\K'^\ = 2\Ke\. This interval is small than that of RT,e- Then, by maximality, 3K'^ 
contains some Rs^e, for s G T. Let s' be the tile s < s' < ojt x Rt for which 
Rg' = K'^ X -Rt.ex- We have Gk C v~^{uJs>i). And since dense(T) < 6, we deduce 
the important inequality that 

\Gk\ < \Knv-\iVg,)\<6\K\. 

If T{K, +) is a 2-tree, then the sets {u}s2 '■ s G T{K, +)} are pairwise disjoint, so 
that the set v~^{u:s2) are either equal or disjoint. Hence 

Our desired bound Jj^\A^^^'^^\ dx < o"5|i^'| is immediate. 

If T{K, +) is a 1-tree, then all of the interval u:s2 contain ojt- Thus, for each 
x, there are e±{x) so that v{x) G ujs2 iff < scl(s) < £+(x). In particular, if 

v{x) G ujs2, then we have \v{x) — e| < £:_(x)/ann. This permits us to argue that the 
vector field v{x) can be assumed to be constant. Specifically one sees the inequality 

lu;.2(^(a;)) / \^six-yv{x))-(psix-ye)\\tpsiy)\dy<^—XRlix). 
Jr sci{s) 

This inequality is in fact a straight forward calculation, but one that depends very 
much on the uniform assumptions (2.17). 
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We make more definitions, and in the first definition, replace the vector field by e. 

:= j (fisix -ye)i)s{y) dy, 
Ak:= Y1 {f.^s)lu:Jv{xM, KeJC, 

In the last line, it is a matter of convenience to assume that all trees T(i^, +) are 
1-trees, and that T = lJ^g^T(i^, +). We have, by a straightforward estimate, 

so that it suffices to estimate the norm of A on the interval K. 

There are two points: The first is that for each K, Ak is dominated by a maximal 
function applied to B. The second is that B obeys the the inequality 

(6.8) \\B\\2<a\RT:\^'^ 



Let us turn to the second claim. The claim will follow immediately if we can see 
that the we have the universal estimate 

(6.9) lEo.^^ll.^siEi-''"" 

seT seT 



a. 



This holds for all numerical sequences {cts}. 

Indeed, the assumptions that we have placed on the functions namely that it 
have Fourier support in a small ball around the origin, and the uniform assumption 
on the Fourier support of the functions V's, as expressed in (2.16), imply that the 
functions (f)'^ are supported in Ug-^ As these rectangles are either equal or disjoint, it 
suffices to prove the estimate (6.9) when the tree T has Us = Us' for all s, s' G T. 
With this assumption, the assumption (2.17) (with the value of k, suitably small) 
implies the estimate 



si 



And this is more than enough to prove (6.9) under this restrictive assumption on the 
tree. 

Let us dominate Ak- For any choice of < s+, we have 

l{£_<scl{s)<£+}0l = (Ce- - Ce+) * 01 

in which we take C, to be non-negative Schwartz function on the plane satisfying 
l[-i/2,i/2]2 < C < l[-5/8,5/8] and set C,e{xe,Xe^) = £:annC(£:Xe, annXe^ ) . The identity 

^This calculation is available to us as we have replaced the vector field in the definition of 0^ by 
a fixed vector e. 
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follows from the frequency properties of Lp and of the class of functions il)t as described 
in (2.16). From this, we conclude that 

\^k\ < sup f \B{z) \ dz 
jdrJj 

with the last quantity being a supremum over all rectangles J that contain K. This 
supremum is constant on K. Thus, 

Ak\ dx < \Gk\ inf MB{x) < 5\K\ inf MB{x) 

x&K xGK 

where M denotes the strong maximal function with respect to the (e, ej,) coordinates. 
This last estimate is to be summed over K C 4i?T- 




<6\RTnMB\\2 
< 6ct\Rt\ 

This completes the the proof of the Tree Lemma. 
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